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NOVEL TRANSCRIPTIONAL REGULATORS AND USES THEREFOR 

Priority Claim 

The present application claims priority to United States provisional application 
number 06/056,857, filed August 27, 1997, the entire contents of which are incorporated 
herein by reference. 

Government Support 

The work described herein was supported by grants NIGHM22526, NIGHM52409, 
and NSFMCB9604458 for the United States Government; the United States Government may 
have certain rights in this invention. 

Background of the Invention 

Regulated gene expression controls cell differentiation and development. Many 
medical disorders can be traced to errors in the control of gene expression. Much of 
biological research is directed toward an understanding of the mechanisms of gene 
regulation. The holy grail in these studies is the ability to control gene expression. 

Very often, gene expression is regulated at the level of transcription. A great deal is 
known about the machinery involved in gene transcription, and about the factors that regulate 
it. Most transcriptional regulatory factors are proteins containing at least two functional 
domains: one that binds to a specific site in DNA and one that interacts, directly or 
indirectly, with some component of the transcriptional machinery to exert hs regulatory 
effect. These functional domains can typically be separated from one another without losing 
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their activities, and domains can be swapped from one regulator to another. Also, several 
knovra regulators that are naturally found in a particular organism (e.g., yeast) have been 
found to maintain their activity when introduced into cells of a different organism (e.g., 
human cells). These observations suggest that at least some transcriptional regulators employ 
5 mechanisms that have been conserved in evolution, and that work done in one organism is 
likely to be generalizable to others. 

Despite the large database of accumulated knowledge about the transcriptional 
machinery and the characteristics of activators, much remains to be leamed. For example, 
there is currently much debate over the mechanism by which regulators interact with and/or 
10 alter the activity of the transcriptional machinery. Also, there remains a need for the 
identification of new and different techniques for controlling gene expression. 

Definitions 

''Associated" The term "associated" is used herein to describe any physical or 
1 5 functional linkage between two moieties. The association may be constitutive or inducible. 
Moieties may be associated by covalent linkage, hydrogen bonding, van der Waals 
interactions, hydrophilic or hydrophobic interactions, or any other means that preserves 
functionality. Where polypeptide moieties are associated by covalent linkage, they form a 
"fusion protein". Similarly, where nucleic acid molecules are associated by covalent linkage 
20 (e.g., by means of a 3-5' phosphodiester bond, a 2-5' phosphodiester bond, or some other 
covalent bond), they form a "fusion molecule". Associations according to the present 
invention may be direct (i.e., may involve physical contact between or among the relevant 
moieties) or indirect (e.g., may involve one or more other compounds that mediate(s) the 
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interaction(s) between or among the moieties). One preferred method for producing an 
inducible association is to utilize a mediator molecule whose expression or production is 



itself inducible. 

"Isolated" A molecule or compound is "isolated" as that term is used herein if is 
separated from one or more molecules or compounds with which it is associated in nature. 
Also, any molecule or compound that itself is never produced in nature is by definition 
"isolated". Any molecule or compound that has been subjected to one or more purification 
steps is "isolated". Molecules or compounds produced in vitro are also "isolated". As used 
herein, a molecule or compound is "substantially purified" when it is at least about 90% pure. 

"Moiety" -- A "moiety", as that term is used herein, is a molecule or a portion of a 
molecule having a given activity or function. For example, many, if not most, known 
proteins that regulate transcription have at least two defined functional domains, one that 
directs DNA binding and one that confers regulation on genes operationally linked with a 
DNA site to which the protein binds. According to the present invention, the portion or 
portions of such a protein that are sufficient for DNA binding activity constitute a DNA 
binding moiety; the portion or portions sufficient for transcriptional regulation constitute a 
regulatory moiety. Those of ordinary skill in the art will appreciate that a moiety need not be 
a protein domain, nor a polypeptide of any sort. Any molecule, or collection of molecules, 
having the designated activity can constitute a moiety according to the present invention. 

"Non-naturally occurring" - The phrase "non-natural ly" occurring, when applied to 
RNA molecules herein, refers to RNA molecules that i) have a nucleotide sequence that is 
not found in the organism in which they are active; and/or ii) are in a form that is not found in 
nature. For example, an RNA molecule whose complete nucleotide sequence (i.e., the 
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sequence of the whole molecule) is not found within any natural RNA molecule endogenous 
to the cell in which the RNA is activating transcription is non-naturally occurring according 
to the present definition. Also, any RNA molecule that contains one or more sequence 
elements that are not naturally found in that cell is a non-naturally occurring RNA molecule. 
5 A "sequence element", as used herein, is any portion of an RNA molecule's sequence. All 
RNA molecules contain one or more sequence elements ( since a single "A", "U", "G", or 
"C" is a sequence element); non-naturally occurring RNAs of the present invention are 
distinguished by having one or more sequence elements that are not found in naturally- 
occurring RNAs in the cell in which the non-naturally occurring RNA is active. Alternative 

10 non-naturally occurring RNAs are those that, though their nucleotide sequence may be 

identical to that of an RNA found in nature, are in a form not found in nature. For example, 
any 'isolated" RNA is non-naturally-occurring. Also, any RNA molecule that has been 
modified so that it includes one or more chemical groups not found in naturally-occurring 
RNAs of the same sequence, or lacks one or more chemical groups naturally found in RNAs 

15 of the same sequence is non-naturally occurring according to the present invention. 

"Operationally linked" The phrase "operationally linked", as used herein to 
describe a site in a nucleic acid, means that the site is located in three-dimensional space in a 
position and orientation that allows a regulator binding to the site to exert its effects on 
transcription. As is known in the art, transcriptional regulatory sites are commonly linked by 

20 covalent association to the promoter whose expression they regulate. That is, regulator sites 
are typically part of the same DNA molecule as the promoter. In most cases, such sites are 
located upstream of the promoter, though examples of regulatory sites that are effective when 
positioned downstream of the promoter, either within or downstream of the gene, are known. 
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According to the present invention, covaient linkage is not essential for operational linkage 
between a regulatory site and a promoter. A regulatory site for use in accordance with the 
present invention may be operationally linked to a promoter, for example, by being provided 
as a separate DNA molecule or by being embedded within the transcript whose expression 
5 the promoter directs. 

Summary of the Invention 

The present invention provides novel transcriptional regulators comprised of 
ribonucleic acid (RNA). The RNA regulators of the present invention aher the rate and/or 

1 0 the extent of transcription from a promoter when they are delivered to a site that is 

operationally linked to that promoter. Those inventive RNA regulators that increase the rate 
or level of transcription from a given promoter are termed "riboactivators"; those that 
decrease the rate or level of transcription are termed "riborepressors". Preferred RNA 
regulators of the present invention are capable of association with a DNA binding moiety 

1 5 characterized by an increased affinity for a particular DNA site as compared with DNA 

generally. Particularly preferred regulators are characterized by an ability to interact 

specifically with a TATA-binding protein (TBP) present in the cells in which they are active. 

Certain preferred riboregulators have a nucleotide sequence that includes a region 

conforming to the consensus 5'-UGC(G>U>A)GG(U>A>C)(lJ>ACG)(C>A)(G>A>U)-3' 
20 (SEQ ID NO:4). 

Description of the DraAvings 
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Figure 1 presents the Wickens/Fieids three-hybrid system for identifying proteins that 
interact with known "bait" RNAs. 

Figure 2 presents an inventive two-hybrid system utilized to identify riboactivators. 

Figure 3 presents a Northwestern blot demonstrating a specific interaction between 
riboactivator number 7 and yeast TBP. 

Figure 4 presents a Northwestern blot demonstrating that riboactivator number 7 
binds to yeast TBP (lane 3), but not to yeast TFIID lacking TBP (lane 4), to mammalian 
TFIID (lane 2), or to Yeast RNA polymerase II holoenzyme (lane 5). 

Description of Certain Preferred Embodiments of the Invention 

Riboregulators 

Any RNA molecule, or portion of an RNA molecule, that alters transcription from a 
promoter when recruited to a regulatory site operationally linked with that promoter is a 
riboregulator of the present invention. Both riboactivators and riborepressors are 
encompassed within the present invention. Prefen-ed riboregulators affect transcription 
initiation, but RNAs that affect elongation, reinitiation, termination, and/or pausing are also 
included within the present invention. 

Preferred riboregulators of the present invention are active in vitro and/or in vivo. 
Preferably, the riboregulators are active in one or more cell types selected from the group 
consisting of bacterial cells, yeast cells, mammalian cells, insect cells, plant cells, reptile 
cells, celenorate cells, and protozoan cells. Particularly preferred riboregulators are active in 
yeast, mouse, and/or human cells. Preferred riboregulators of the present invention are active 
at more than one promoter in a particular cell (i.e., affect transcription at various promoters 
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when recruited to sites operationally linked to those promoters). Particularly preferred 
riboregulators are active both in vivo and in vitro. 

According to the present invention, preferred riboregulators affect transcription by the 
same molecular mechanism utilized by one or more naturally-occurring protein factors that 
regulate transcription. For example, one class of naturally-occurring transcriptional 
activators, known as "acidic activators'', is thought to stimulate transcription by contacting 
one or more components of the general transcription machinery and recruiting that machinery 
to the promoter. Although the details of this mechanism are not well understood, a 
"squelching" assay has been developed to identify activators that fall within this mechanistic 
category. The term "squelching" refers to the fact that, when a transcriptional activator is 
expressed at high levels in a cell in which u is active, the effect may be to reduce gene 
transcription in that cell, sometimes even to the point of killing the cell. Current models 
explain this observation by hypothesizing that non-DNA-bound molecules of the highly- 
expressed activator compete with DNA-bound activators for interaction with the same target 
or targets in the general transcription machinery, so that there is less target available for 
interacdon with DNA-bound activators, the transcription machinery is recruited to promoters 
less frequently, and transcription is therefore reduced. For purposes of the present invention, 
any riboactivator that squelches a known acidic activator is classified as one that works by 
the "acidic activator mechanism". Preferred riboactivators of the present invention squelch, 
or are squelched by, Gal4 when expressed in yeast and/or in mammalian cells. 

In certain preferred embodiments of the invention, the riboregulator includes a stem- 
loop structure. Particular embodiments of preferred stem-loop containing riboregulators are 
described in Example 2. 
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Riboregulators of the present invention may be constitutively active or may 
alternatively be active only under specified conditions. For example, the techniques and 
methods described herein may readily be employed to select riboregulators that exert their 
effects on transcription only in the presence (or absence) of a chemical molecule, a 
modification enzyme (e.g., a methylase), and/or some component of the growth medium. 
Riboregulators that are active only in the presence (or absence) of antibiotics may be 
particularly easy to identify since several RNA-binding antibiotics are known. The 
techniques of the present invention may also be employed to identify, for example, 
temperature-sensitive riboregulators. All such conditional riboregulators are encompassed 
within the scope of the present invention. 



DNA binding moieties 

As discussed above, inventive riboregulators are RNA molecules that alter the rate or 
extent of transcription from a promoter when recruited to a regulatory site operationally 
linked to that promoter. Typically, such recruitment is accomplished by association with a 
DNA binding moiety. Any chemical compound may be utilized as a DNA binding moiety 
providing that it has sufficient affinity for DNA to recruit the riboregulator to a regulatory 
site and also has the ability to form an association with the riboregulator. For example, 
nucleic acids, polypeptides, intercalation compounds, and/or any chemical compounds that 
bind to DNA may be utilized as DNA binding moieties. Preferably, the DNA binding moiety 
is characterized by an increased affinity for a particular regulatory site as compared in DNA 
generally, so that the DNA binding moiety is said to display sequence-specific DNA binding 
as that term is understood in the art. 
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The preferred means by which the riboregulator is associated with the DNA binding 
moiety depends on the chemical nature of the moiety. For example, where the DNA binding 
moiety is a DNA or RNA molecule, the riboregulator may be associated with the moiety 
through covalent linkage (e.g., via a 3-5' or a 2*-5' phosphodiester bond). In certain preferred 
5 embodiments, the riboregulator is produced as a single RNA molecule with the DNA binding 
moiety. Alteratively, the riboregulator may be associated with a nucleic acid DNA binding 
moiety through base-pair interactions or other three-dimensional nucleic acid interaction. 
Where the DNA binding moiety is a polypeptide or a chemical compound, the riboregulator 
may be covalently linked to the moiety, but is preferably associated by non-covalent 
1 0 interaction. 



Applications 

Those of ordinary skill in the art will recognize that the riboregulators of the present 
invention are useful in a wide variety of contexts. For example, they are useful tools in the 
efforts to understand the mechanism of gene regulation. They are also useful as reagents for 
identifying and dissecting RNA-RNA and RNA-protein interactions. Furthermore, the 
inventive riboregulators are useful as agents for controlling gene expression. In one 
particular embodiment of the present invention, the inventive riboregulators may be used as 
therapeutic agents to modulate gene expression in vivo in order to alleviate or correct a 
disease state. Certain preferred embodiments of these applications are discussed in more 
detail below. 

Experimental tools 
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Much effort in the field of transcriptional regulation has been directed toward the 
identification of the interaction target for the protein regulators that activate and repress 
transcription in vivo. The riboregulators of the present invention provide valuable new tools 
for this endeavor. Riboregulators that act by the same mechanism as natural transcriptional 
regulators are particularly useful in this regard. One advantage of riboregulators, as 
compared with protein factors, for these types of studies is that a variety of RNA-specific 
techniques are available that can be employed to identify and characterize any interaction 
partners. For example, riboregulators can be readily cross-linked to any interaction target or 
targets with which they physically interact. Techniques for cross-linking single-stranded 
RNAs with nearby proteins are well known in the art. Techniques for cross-linking double- 
stranded RNAs, which techniques are useful for riboregulators containing stem-loop or other 
double-stranded secondary structures, have recently been developed (see, for example, Liu et 
al., in Abstracts of Papers Presented at the 1997 Meeting on Eukaryotic mRNA Processing, 
Cold Spring Harbor Laboratory, August 20-24, 1997, pg. 82). Other usefiil RNA-specific 
techniques include, for example, Northwestems, gel mobility shift assays, filter binding 
assays, etc. (see Example 4). Also, complexes containing an RNA molecule can be detected 
by reverse transcription and amplification. 

Riboregulators of the present invention may also be used in novel multiple-hybrid 
experiments to identify RNA or protein molecules that interact with particular test RNAs. 
Most available di-hybrid or multiple-hybrid systems rely on protein components and are not 
readily adaptable to analysis of RNA-RNA interactions. Even RNA-protein interactions are 
often difficult to study with current systems; only the Wickens system has been developed for 
that purpose. The inventive demonstration of activating RNAs, however, allows ready 
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production of multiple-hybrid systems that can analyze RNA-RNA and RNA-protein 
interactions. For example, the test RNA can be produced as a single hybrid molecule with 
the riboactivator and then screened against either a protein library fused to a DNA binding 
domain (a two-hybrid system) or an RNA library fused to an interaction moiety that allows 
the RNA library to be recruited to DNA (a three-hybrid system). Without a riboregulator, 
RNA-RNA interactions could only be studied in "four-hybrid" systems, an impractical, if not 
infeasible arrangement. 

Inventive riboregulators may also be employed in multiple-hybrid experiment to 
identify RNAs that interact with a test protein, hi the simplest version of this experiment, the 
test protein is fused to a DNA binding domain and an RNA library is fused with the 
riboregulator. Such studies can lead to the development of "'RNA linkers" that can be used to 
bring any two proteins together. That is, once RNAs are identified that interact with first and 
second proteins, those RNAs are attached to one another (preferably by producing both 
linked together as a single RNA molecule) to produce an RNA linker. Such RNA linker 
molecules, although not necessarily riboregulators. are also within the scope of the present 
invention. 

Therapeutic reagents 

A large number of medical problems are caused by over- or under-expression of a 
particular gene, or by mutation of a gene. Many more can be alleviated by increased or 
decreased gene expression. The inventive riboregulators, which can be specifically targeted 
to activate or repress a selected gene, are useful as therapeutic agents. 
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For example, metastatic prostate cancer is currently treated with two drugs, 
leuprolide, which blocks testosterone production, and flutamide, which acts as an androgen 
receptor antagonist. These drugs are effective against prostate cancer because prostate cancer 
cells require hormone stimulation to grow and divide. Unfortunately, these drugs lose their 
effectiveness after about two years of treatment because resistant cancer cells arise. It is 
known that at least 80% of these resistant cells express the androgen receptor. Moreover, 
these cells become resistant to leuprolide and flutamide either by amplifying the androgen 
receptor gene so that much more receptor is made or by mutating that gene so that the 
receptor it produces is capable of stimulating cell proliferation even when it is not bound to a 
hormone ligand. According to the present invention, an improved method of treatment for 
prostate cancer would involve administration of an agent that represses the androgen receptor 
gene. Such an agent could be given alone or in combination with leuprolide and/or 
flutamide. An inventive riborepressor is such an agent. Preferably, the riborepressor is 
prepared in association with a DNA binding moiety that targets it to a regulatory site that is 
operatively linked to the androgen receptor gene. Alternatively or additionally, the 
riboregulator may be associated with an interaction moiety that mediates an association 
between the riboregulator and a DNA binding entity endogenous to the cells to which the 
riboregulator is delivered. The riboregulator is packaged into a pharmaceutical formulation 
according to known techniques and procedures, and is preferably delivered orally. 

As but one more example, inventive riboregulators are useful int he treatment of 
anemia and other disorders related to erythrocyte production. Erythropoietin (Epo), a protein 
agent that stimulates production of erythrocytes, is perhaps the most successful druo 
produced through biotechnology. Recent studies have shown that it is possible to make Epo 
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even more effective by co-administration of an agent, known as interleukin-l, that increases 
transcription of the erythropoietin receptor (EpoR). Unfortunately, interleukins have proven 
not to be generally useful as pharmaceutical agents because of their associated toxicities. 
The present riboactivators provide an alternative agent that could be used to stimulate EpoR 
gene expression. Because inventive riboactivators can be designed to be highly specific to 
the particular gene to be activated, they should not have the toxicity problems associated with 
interleukins, which have broad-ranging activities. 

It is appreciated that the structure of the inventive riboregulators may have to be 
modified in order to ensure their effectiveness as therapeutic agents. It may be necessary, for 
example, to increase the stability of the riboactivator in order to ensure that it persists Ions 
enough in vivo to have a significant effect on gene expression. It is known that addition of 2'- 
0-methyl. or-phosphorothiol groups increases the stability of RNA molecules in vivo. Also, 
circularized versions of the molecules may be more stable. Other modifications might be 
made in order to increase bioavailability of the riboregulators, and/or to increase the extent to 
15 which they are taken up by cells. Those of ordinary skill in the art will appreciate that any of 
a wide variety of modifications may readily be tested, and the resultant modified RNAs can 
be assayed as described herein to ensure that they retain their functionality. 



10 



Examples 

20 Example 1 

A Screen for the Identification of Riboregulators 
We have developed a novel two-hybrid transcriptional regulation system for use in 
the identification of riboregulators. In order to simplify the presentation of our system, we 
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first describe a prior art system that was prepared by Wickens and Fields to allow the 
identification of proteins that interact with known RNAs (SenGupta et al., Proc. Natl. Acad. 
Sci. USA 93:8496, 1996). 

The Wickens/Fields system, which is depicted in Figure 1, utilizes i) a first hybrid 
100 comprising a DNA binding moiety 200 (the lexA DNA binding domain) fused to an 
RNA binding moiety 300 (the MS2 coat protein); ii) a second hybrid 400 comprising a 
recruiting RNA 500 (the MS2 RNA) ftised to the bait RNA 600; iii) and a third hybrid 700 
comprising a target protein 900 fused to a transcriptional activation domain 900 (the Gal4 
activation domain). The DNA binding moiety 200 recognizes a site 1000 positioned 
upstream of a reporter gene 1100 and recruits the bait RNA 600 to that site because of the 
interaction between the RNA binding moiety 300 and the recmiting RNA 500; transcriptional 
activation results when the bait RNA 600 interacts with the target protein 700, so that the 
transcriptional activation domain 800 is brought to the DNA. As with other known two- 
hybrid (or multi-hybrid) transcriptional activation assay systems, the Wickens/Fields system 
is designed to screen protein libraries (specifically, libraries of target proteins) to identify 
those with a desired interaction capability (i.e., the ability to interact with the known bait 
RNA). 

We have modified the Wickens/Fields system as shown in Figure 2. Our system 
employs only two hybrids, one 100 comprising a DNA binding moiety 200 fused to an RNA 
binding moiety 300, and one 400 comprising a recruiting RNA 500 linked to a bait RNA 600 
(the riboregulator). In this system, the bait RNA 600 is recruited to the DNA because of the 
interaction between the RNA binding moiety 300, which itself is bound to DNA because of 
its linkage to the DNA binding moiety 200, and the recruiting RNA 500; transcriptional 
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activation results whenever the bait RNA 600 is a riboactivator; transcriptional repression 
results when the bait RNA 600 is a riborepressor. 

The inventive system depicted in Figure 2 differs from the Wickens/Fields system in a 
least two major ways: i) it does not rely on a third hybrid, containing a transcriptional 
activation domain, to mediate transcriptional regulatory effects but rather detects 
transcriptional regulation directed by the bait RNA itself; and ii) the bait RNA is a library of 
RNA molecule from which active RNAs are identified, rather than a known RNA that is used 
to identify active interacting proteins. 

hi the particular embodiment of the inventive system that is depicted in Figure 2, the 
DNA binding moiety 200 consists of a lexA DNA binding domain; the RNA binding moiety 
300 is the MS2 protein; and the recruiting RNA 500 is the MS2 RNA. In light of the above 
discussion, those of ordinary skill int he art will appreciate that each of these system 
components could readily be substituted with a different molecule (or group of molecules) 
that performs substantially the same function. For example, any molecule or compound that 
is capable of both (i) specific recognition of a DNA sequence and ii) interaction with an RNA 
binding moiety, when utilized under the conditions of the assay, is useful as a DNA binding 
moiety in accordance with the present invention. 

Also, any of a variety of molecules or compounds may be utilized as RNA binding 
moieties and recruiting RNAs in place of the MS2 protein and MS2 RNA depicted in Figure 
2. Useful RNA binding moieties must interact specifically with both the DNA binding 
moiety and the recruiting RNA under the conditions of the assay; useful recruiting RNAs 
must interact specifically with the RNA binding moiety and the bait RNA. 
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Any of a variety of different genes may be utilized as reporter genes, so long as their 
gene products are detectable. In screens for riboactivators, it is generally preferred that the 
RNAs be tested for their ability to activate at least one reporter gene that is an essential gene 
under the conditions of the assay (so that the assay is a selection, rather than a screen). In 
screens for riborepressors, it is preferred that the RNAs be tested for their ability to repress a 
gene whose expression is toxic under the conditions of the assay, hi such assays, it may also 
be desirable to arrange that the cell's sensitivity to the toxic gene be inducible, in order to 
avoid the problem of the cell being killed by expression of the gene before the potential 
riborepressors have had the opportunity to exert their effects. 

When using the screen depicted in Figure 2, an RNA is identified as a riboregulator if 
it exerts any reproducible effect (increase or decrease beyond that expected error) on 
transcription. Preferred riboactivators increase or decrease gene expression by at least 2-fold, 
preferably at least 5-fold, more preferably at least 10-20 fold, yet more preferably at least 100 
fold, at most preferably at least 1000 fold. 



Example 2 

Identification and Characterization of Riboactivators from an "RIO" Library 
Materials and Methods 

Strains and Media: The Z¥0-coast yeast strain (a derivative of the uraS L40 strain 
containing the lexA-MS2 fusion protein integrated into the genome; MATa, ura3-52, leu2-3, 
-112. his3A200, trplAl. ade2, LYS2::(lexA-op)-HlS3, ura3 : : (lexA-op)-LacZ, LexA.MS2 coat 
protein (TRPl)) was generously provided by Marvin Wickens. Rich (YPD) and synthetic (S) 
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media were prepared as described by Rose el al. {Methods in Yeast Genetics, Cold Spring 
Harbor Press, Cold Spring Harbor, NY, 1990). 

Construction of the RIO LraRARY: An oligonucleotide, called "01igo2" that 
included a 15nt stem-loop and a randomized 10-mer loop was synthesized. Oligo 2 had the 
sequence 5'- 

CTCTGGGAGCTGCGATTGGCAGAATTCCGGCTAGAACTAGTGGATCCCCCGGGCG 
AGGCTTATCCNNNNNNNNNNGGATGTGCTGACCCCGGGCAGCTTGCATGCCTGC 
AGGTCGACTCTAGAAAACATGAGGATCACCC (SEQ ID NO:l). Two other 
oligonucleotides, "Oligo 1" and "Oligo 3" were also prepared. The sequences of these 
oligonucleotides were selected so that they could be used to ampliiy Oligo 2 to produce a 
template library that, when transcribed, would produce the "RIO" library of RNA molecules. 
Oligo 1 and Oligo 3 had the following sequences: 5'-CTCTGGGAGCTGCGATTGGC (SEQ 
ID NO:2) and 5'-GGGTGATCCTCATGTTTTCT (SEQ ID NO:3), respectively. 

Oligo 2 was amplified in ten simultaneous polymerase chain reactions utilizing 72 uL 
water, 109 uL 10 X PCR buffer (Promega), 4 uL magnesium chloride (Promega, 2 uL 
nucleotide mix (10 mM each), 3 uL Oligo 1 (500 ng/uL), 3 uL Oligo 3 (500 ng/uL), 5 uL 
Oligo 2 (50 ng/uL), and 1 uL Taq DNA polymerase each. Following one cycle of 5 minutes 
at 94 °C, 2 minutes at 50 "C, and 3 minutes at 72 °C, thirty cycles of 1 minute at 94 °C, 2 
minutes at 50 °C, and 3 minutes at 72 °C were performed. A final 10 minute extension 
reaction at 72 °C was then performed. Products were purified over a G25 spin column and 
were ethanol precipitated. Products were then resuspended in sterile 1 X TE, to a final 
volume (firom all 10 reactions) of 100 uL. 
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Yeast Transformation. Yeast transformation was performed by the lithium 
acetate method (Rose et al.. Methods in Yeast Genetics, Cold Spring Harbor Press, Gold 
Spring Harbor, NY, 1990), exploiting the gap repair phenomenon described by Rothstein 
{Methods Enzymol, 194:281, 1991). 1 ^ig, 5 ^g, 10 pg, or 20 ^g of RIO library PGR product 
were introduced into yeast cells in combination with Smal-cut pni/MS2 vector. 
Transformants were selected on S media lacking the appropriate amino acids. 

X-Gal Plate Screen: Transformants were grown on white nitrocellulose filters 
placed on an appropriate selective medium. Subsequently, the filters were transferred to a 
synthetic medium containing 2% glycerol and 50 |ig/ml x-gal. Pale blue to dark blue 
colonies were identified, the plasmids they contained were isolated according the known 
procedures (Ward, Nuc. Acids Res., 1990), and the isolated plasmids were retransformed into 
the same strain and were retested for transcriptional activation activity. 

Liquid P-Galactosddase Assays: Transformants that turned blue on the X-gal 
plate screen were grown to an OD^oo of between 1 .0 and 2.0 in 5 mis of selective media 
containing 2% glucose, p-galactosidase activity was assayed as described (Rose et al.. 
Methods in Yeast Genetics, Gold Spring Harbor Press, Gold Spring Harbor, NY, 1990); units 
of activity were calculated as nmol/min/mg protein. Each assay was performed at least in 
duplicate on several occassions. 

GONSTRUCTION OF STRAIN FOR GAL4-MS2 EXPERIMENTS: A blaster plasmid (Alani et 
al., Genetics 1 16:541, 1987) was utilized to disrupt the TRP gene of the GGYl strain (Gill et 
al.. Nature 334:721, 1988) by integration of the URA3 gene. URA3 gene was deleted by 
recoving yeast colonies that could grow on plates including 5-FOA. 
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PRODUCTION OF Keene RNA-MS2 fusion: We ordered two oligonucleotides, Keene- 
1 (5'-CCGGGCGAGGCTTATCCTGGTGGAGCAGGATGTGCTGACC; SEQ IDN0;18) 
and Keene-2 (5'-CCGGGGTCAGCACATCCTGCTCCACCAGGATAAGCCTCGC; SEQ 



ID NO: 1 9). These oligos were annealed to one another and were cloned into the Xmal site of 

5 pMS2-2 (SenGupta et aL, Proc. Natl. Acad ScL USA 93:8496, 1 996). A number of clones 

were sequenced; one clone that had insert in the proper orientation was tested for 

transcriptional activation capability in yeast. 

production of TAR-MS2 FUSION: We ordered two oligonucleotides, Tar-1 (5'- 

CCGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGCTAACTAGA 
10 GAAC; SEQ ID NO:20) and Tar-2 (5'- 

CCGGGTTCTCTAGTTAGCCAGAGAGCTCCCAGGCTCAGATCTGGTCTAACCAGAG 
AGAC; SEQ ID NO:21). These oligos were annealed to one another and were cloned into 
the Xmal site of pMS2-2 (SenGupta et al., Proc. Natl Acad, Sci. USA 93:8496, 1996). A 
number of clones were sequenced; one clone that had insert in the proper orientation was 
15 tested for transcriptional activation capability in yeast. 

PRODUCTION OF NUMBER 5-Tar STEM-MS2 FUSION: We ordered two 
oligonucleotides, 5/Tar-l (5'- 

CCGGGTCTCTCTGGTTAGACCAGATCTGAGCCGGGATGCTCTCTGGCTAACTAGA 
GAAC; SEQ ID NO:22) and 5/Tar-2 (5'- 

20 CCGGGTTCTCTAGTTAGCCAGAGAGCATCCCGGCTCAGATCTGGTCTAACCAGAG 
AGAC; SEQ ID NO:23). These oligos were annealed to one another and were cloned into 
the Xmal site of pMS2-2 (SenGupta et al., Proc. Nad Acad. ScL USA 93:8496, 1996). A 
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number of clones were sequenced; one clone that had insert in the proper orientation was 
tested for transcriptional activation capability in yeast. 

FUSION OF NUMBER 7 LOOP TO 6-BASE PAIR SU4l STEM: We ordered two 
oligonucleotides, 7/Sma-l (5'-CCGGGTGCTGGATCAC; SEQ ID NO:24) and 7/W2 (5'- 
CCGGGTGATCCAGCAC; SEQ ID NO:25). These oligos were annealed to one another and 
were cloned into the Xmal site of pMS2-2 (SenGupta et al., Proc. Natl. Acad. Sci. USA 
93:8496, 1996). A number of clones were sequenced; one clone that had insert in the proper 
orientation was tested for transcriptional activation capability in yeast. 

Results 

In designing our riboactivator screening system, we first considered what length RNA 
we should screen. In order to answer this question, we considered what is known about RNA 
structure and protein-RNA interactions. One experimental system that has provided 
significant amount of information about the probability that random RNAs will 
structures, and also about certain characteristics of protein-RNA interactions, is the 
Systematic Evolution of Ligands by Exponential Enrichment (SELEX) system developed by 
Larry Gold and colleagues (reviewed in Gold et al,Annu. Rev. Biochem. 64:763, 1995). In 
this system, libraries of random nucleic acid molecules are synthesized in vitro and are 
screened, also in vitro, for their ability to interact with high affinity with particular targets. A 
sufficient number of SELEX experiments has been described to support the generalization 
that RNA transcripts with random base compositions are likely to assume some structure is 
they are at least 25 nucleotides long. These experiments have also shown that proteins that 
bind to 25 nt-long RNAs tend to interact with about 10-15 nts of the RNA. Although nothing 



a 
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in these experiments indicates that RNAs must be 10-25 nt long to have secondary structure, 
or to have protein binding capability, we decided to utilize RNAs within this size range in our 
original screen. 

In one particular SELEX experiment, researchers produced a library of RNA stem- 
loops and screened it to identify those that bind to an antibody at the antigen recognition site 
(Tsai et al., Proc. Natl. Acad ScL USA 89:8864, 1992). The RNA library was produced by 
inserting a random RNA 10-mer between two complementary sequences that could form a 
stable base-paired stem. Thus, all of the RNA molecules in the Hbraiy contained the same 
stem. In some cases, the inserted 10-mer probably included additional self-complementary 
nucleotides, so that the stem was further extended; in others, the 10-mer probably remained 
as a loop. 

Although it was clearly not our only option, we chose to utilize a stem-lOmer-loop 
strategy analogous to that described above in our initial screen for riboactivators. We 
prepared a library of RNA molecules, termed the "RIO library" that each comprised two 15- 
nt complementary sequences flanking a random 1 0-mer RNA. Use of this library was 
attractive for two reasons. First, because the random RNA in the library is relatively short in 
sequence, it is feasible to screen every possible molecule. Also, it has been reported that 10 
nt is the maximum size for an RNA loop in vivo (Wyatt et al. in The RNA World (Gestekand 
et al., eds.). Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., pp. 465-496, 
1993). Thus, when expressed in vivo, this library is likely to contain molecules with a variety 
of different stem-loop structures, ranging from structures with the minimum allowable loop 
size (thought to be 2 nt; Wyatt et al. in The RNA World (Gestekand et al., eds.). Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., pp. 465-496, 1993) to the maximum 
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allowable loop size (10 nt). Presumably, bulged nucleotides are also not uncommon in the 
RIO library. Given that many RNA binding proteins recognize portions of RNA molecules 
that include loops or bulges (Draper, Annu. Rev, Biochem. 64:593, 1995), we expected that 
many molecules in the RIO population would be capable of interacting with proteins. 

We transformed the RIO library into yeast, relying on the gap-repair method to 
integrate the library into a co-transformed plasmid containing the MS2 RNA so that, hybrid 
RNAs containing potential riboactivators linked to the MS2 RNA were produced in vivo. 
The yeast strain into which we transfomied the library contained a gene for the lexA-MS2 
fusion protein integrated in its genome, and also contained two different reporter genes, HIS3 
and LacZ, each of which is linked to a LexA operator. Transformants that contained 
riboactivators were selected for their abiUty to grow on a synthetic medium lacking histidine 
(i.e., for their ability to express HISS), Aminotriazole, a competitive inhibitor of the HISS 
gene product, was added to the medium to increase the stringency of the selection. Of 
approximately 8 million transformants, only approximately 200 expressed sufficient levels of 
HISS to survive on the selective medium. 

These 200 His+ transformants were screened to identify those that also expressed 
elevated levels of P-galactosidase; 12 were chosen as candidates after analysis by plate assay. 
Plasmids were isolated from these 12, were re-transformed into the same strain, and were re- 
assayed to ensure that the observed transcriptional activity was linked to the plasmid and did 
not reflect spurious mutation of the strain. 8 of these re-transformants proved to be plasmid- 
linked. We quantified the levels of activation that we were detecting by performing liquid p- 
galactosidase assays. We also isolated the plasmids from these 8 colonies and determined the 
nucleotide sequence of the RNAs they encoded. The P-galactosidase values and loop 
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sequences of our 8 positive clones are presented below in Table 1. As can be seen, there Wc 
a strong consensus of 5'-UGC(G>U>A)GG(U>A>C)(U>ACG)(C>A)(G>A>U)-3' (SEQ ID 
NO:4) among the activating RNAs. 



TABLE 1 


RIBOACTIVATOR ISOLATE 
NUMBER 


SEQUENCE 


P-GALACTOSIDASE 


control (no insert) 




0.2 


1 


5'-UGCGGGUACG 
SEQ ID NO:5 


23.0 


2 


5'-UUGCUGGCGA 
SEQ ID NO:6 


22.0 


3 


5'-UGCGGGUCAU 
SEQ ID N0:7 


40.0 


4 


5'-UGCGGGUUCG 
SEQ ID NO: 8 


32.0 


5 


5'-UGCGGGAUCA 
SEQ ID NO:9 


56.0 


6 


5'-UGCAGGUUCG 
SEQ ID NO: 10 


70.0 


7 


5'-UGCUGGAUCA 
SEQ ID NO: 11 


87.0 


8 


5'-UUGCUGGCGA 
SEQ ID NO: 12 


33.0 



In performing our sequencing studies, we found that several of our riboactivating 
RNAs had alterations in sequences outside of the engineered stem-loop structure. 
Specifically, riboactivators numbered 3, 5, 6, and 7 had a deletion of a sequence element that 
is normally repeated in the MS2 RNA. Riboactivator number 7 also had a single base 
substitution (C to T) 1 5 nucleotides dovmstream of the stem-lObp structure. 
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We also sequenced eight non-activating RNAs in order to be sure that our hbrary was 
not biased and that our riboactivator consensus truly reflected transcriptional regulatory 
activity rather than some other aspect of the library. Certain exemplary non-activator loop 
sequences are presented below: 



TABLE 2 


CONTROL RNA ISOLATE NUMBER 


SEQUENCE 


3 


5'-CACGGTAAGT 
SEQ ID NO: 13 


6 


5'-CAAAGACAGG 
SEQ ID NO: 14 


8 


5'-GGCTGGTGGT 
SEQ ID NO: 15 


10 


5'-GTAGAGCGA 
SEQ ID NO: 16 



Having identified our consensus sequence, we noticed that two known RNAs, the 
HIV Tar RNA and the RNA described above that was identified as having the ability to bind 
to an antibody raised against a particular 13-amino-acid peptide (Tsai et al., Proc. Natl Acad 
ScL USA 89:8864, 1992), have similar nucleotide sequences to our consensus. Each of these 
RNAs has a stem-loop structure. The Tar loop is similar to our consensus but its stem is 
unrelated; the stem of the other RNA, which we refer to as the "Keene RNA", is identical to 
ours but its loop is less similar to our consensus than is Tar's. We tested whether either HIV 
Tar or the Keene RNA act as a riboactivator in our system. We also tested whether our 
riboactivator number 5 would retain hs activation capability when positioned atop the Tar 
stem. 
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We found that neither Tar nor the Keene RNA activated transcription in our system. 
We further found that riboactivator number 5 retained significant activation capability when 
positioned atop the Tar stem. Table 3, below, summarizes our results with Tar. 



TABLE 3 



RNA 



vector 



number 5 



Tar 



number 5 
on Tar stem 



STEM SEQUENCE 



G-C 
U-A 



C-G 
G-C 



C-G 

G-C 



LOOP SEQUENCE 



CGGGAU 



CUGGGA 



CGGGAU 



P-GAL 



60 



20 



We also tested whether riboactivator number 7 would retain its activity when 
positioned atop a different stem. Specifically, we placed the number 7 loop at the top of a 6- 
basepair stem comprising a Smal site. As shown below in Table 4, we found that 
riboactivator number 7 retained significant activation capability when presented at the top of 
this Smal stem. 



TABLE 4 


RNA 


P-GALACTOSIDASE 


number 7 


80 


number 7 on Smal stem 


40 


control 


0.1 



In order to confirm that our riboactivators were working as expected, we transformed 
them into a control strain lacking the lexA-MS2 hybrid and confirmed that, in the absence of 
this hybrid (which is required to recruit the riboactivators to the DNA), the LacZ sene 



was 



not activated. 
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We performed three different experiments to ensure that our RNA molecules were 
responsible for the activation that we were observing. First, we re-cloned the 40 basepair 
segment encoding activators 7 and 5 into a fresh vector (containing both copies of the MS2 
repeat and also lacking the base substitution that was present in original activator number 7) 
and confirming that the re-cloned activators retained transcriptional regulatory activity. 
Next, we re-cloned activators 5 and 7 in the "wrong" orientation and established that such 
"reverse" activators did not stimulate transcription. Finally, we made sure that our 
riboactivators could stimulate transcription when recruited to DNA by a DNA binding moiety 
other than the lexA DNA binding domain. 

Specifically, we introduced the plasmids encoding our riboactivator/MS2 bindino 
domain RNAs into a strain expressing a Gal4-MS2 hybrid and containing a reporter gene in 
which 5 Gal4 operators were positioned upstream of the iacZ gene. In this experiment, as 
shown below in Table 5, we found that our riboactivators gave the following levels of P- 
galactosidase expression: 



TABLE 5 


RIBOACTIVATOR ISOLATE NUMBER 


P-GALACTOSIDASE 


control (no insert) 


20 


1 


43 


2 


38 


3 


37 


4 


42 


5 


58 


6 


58 


7 


75 


8 


46 
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In order to test whether our riboactivators functioned as "acidic activators", we re- 
transformed isolate number 7 into the L40 coat strain in combination with either a control 
vector (pFL46L) or a vector over-expressing Gal 4 (pSS272-3). We tested the expression of 
5 the LacZ gene in the restraint strains by liquid P-galactosidase assay and found the following: 



TABLE 6 


STRAIN 


P-GALACTOSIDASE 


control 


30 


Gal4 overexpression 


9 



10 

As can be seen, our riboactivator was squelched by Gal 4. 

We also wanted to determine whether our riboactivators had sequences that were 
present elsewhere in the yeast genome. Since the entire yeast genome has been sequenced, 
this question can be answered definitively with a simple computer database search. We 
15 searched with both the exact sequence and its reverse complement for riboactivator number 
7, The exact sequence was present in 12 locations, one of which was the SWJl gene, which 
encodes a component of the SWI/SNF global transcription activator complex. The reverse 
complement was present at 14 locations. Other riboactivator sequences were also found in 
the yeast genome, as summarized below in Table 7. 

20 



TABLE 7 


ACTIVATOR 


GENE NAME 


ACCESSION 


GENE 


ACTIVATOR 






NO. 


FUNCTION/ 


SEQUENCE 








STRUCTURE 


LOCATION 


1 


ALD7 


Z75282 


aldehyde 
dehydrogenase 


1 5 1 nt from end 
of ORE 
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5 



1 


SENS 


Z37993 


26s proteosome 

regulatory 

subvmit 


1685 ntfrom 
beginning of 
ORF; 1149 


2 


YML093W 


Z46660 


similar to P. 
falciparum liver 
stage antigen 
LSA-1 


42 1 nt from end 
of ORF 


2 


YMR006C 


Z49810 


hypothetical 
protein contains 
prenyl group 
binding (CAAX 

box) 


1477 nt from 
end of ORF 


2 


YOR144C 


Z75052 


weak similarity 
to human DNA 
binding protein 
PO-GA 


1070 ntfrom 
start of ORF 


3 


YOR064 


Z74972 


weak similarity 
to human 
retinoblastoma 
bindincT nrotein 
2 


89 nt past ORF 


J 


YBR044C 


Z35913 


similar to 
chaperonin 
HSP60 proteins 


634 nt from 
start of ORF 


4 


PIE2 


Z75017 


phosphotidyl- 

inositol 

phosphate 

phosphatase; 

transmembrane 


1489 ntfrom 
end of ORF 


4 


YOR306C 


Z75214 


similar to 
human X- 
linked pest- 
containing 
transporter 


634 nt from 
start of ORF 


5 


ECM2 


Z35934 


involved in cell 
wall and plasma 
membrane 
biogenesis 


268 nt from 
beginning o f 
ORF 
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5 


SCYl 


Z72605 


suppressor of 
GTPase mutant 


1 89 nt from end 
ofORF 


5 


YDL057W 


Z74105 


hypothetical 
protein 


58 nt after end 
ofORF 


6 


YNL187W 


Z71463 


hypothetical 
protein 


14 nt from end 
of ORF 


6 


YPRlOSc 


U32445 


hypothetical 
transmembrane- 
spanning 
protein 


444 nt from end 
of ORF 


6 


YATl 


X74553 


carnitine acetyl 

transferase; 

mitochondrial 

transmembrane 

protein 


363 nt from 
start of ORP 


7 


YLL013C 


Z73 1 1 8 


similar to 
Drosophila 
pumilio protein 


137 nt after 
bet'inninf? of 
ORF 


7 


TFC6 


U28372 


transcription 
factor; IIlc 
subunit Tau 91 


345 nt after 
beginning of 
PRF 


7 


YLR376C 


U19103 


hypothetical 
iransmembrane- 
spanning 
protein 


526 nt past 
beffinninp of 
ORF 


7 


YBR203W 


Z36072 


hypothetical 
protein 


1 3 1 0 nt past 
ORF start 


7 


YGR257C 


X99228 


hypothetical 
protein with 
mitochondrial 
energy transfer 
protein 
signature 


68 nt past ORF 
start 


7 


MIPl 


Z28134 


mitochondrial 

intermediate 

peptidase; 

transmembrane- 

spanning 

sequence 


300 nt from end 
of ORF 
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5 



7 


YNL033W 


Z71309 


hypothetical 
transmembrane- 
spanning 
protein 


6 1 5 nt past 
beginning of 
ORF 


7 


NUMl 


Z50046 


nuclear 

migration 

protein 


6000 nt from 
start and 2000 
nt from end of 
ORF 


7 


YOL092W 


Z74834 


protein similar 
to YBR147; 
transmembrane- 
spanning 
sequence 


69 nt from start 
of ORF 


7 


SWIl 


U33335 


component of 

SWI/SNF 

global 

transcriptional 

activator 

complex 


1600 nt from 
start of ORF 


7 


RFI2 


Z49919 


chromatin 
assembly 
complex, 
subunit p90 


296 nt before 
start of ORF 


7 


YPR121W 


U40828 


similar to B. 
subtilis 

transcriptional 
activator tenA 


152 nt past 
beginning of 
ORF 



EXAMPLE 3 
Identification of Riboactivators from "R40" library 

10 Results 

We wanted to prepare an RNA library that was not constrained, as is the above- 
described RIO library, by having all of its members contain the same base stem. Also, we 
were interested in preparing a library with a longer region of variability. Szostak and 
Ellington have recommended the use of RNAs 30-60 nts in length for formation of compl 
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Structures such as multiple stem-loops, bulges, pseudoknots, and combinations thereof 
(Szostak et aL In The RNA World (Gesteland et al., eds). Cold Spring Harbor Laboratory 
Press, Cold Springs Harbor, NY, pp. 511-33. We therefore designed an oligonucleotide 
containing 40 randomized nucleotides flanked by the same sequences that flanked the stem- 
5 loop structure in the RIO library, so that the same primers could be used to amplify the R40 
library as were used to amplify the Rl 0 library. The R40 oligonucleotide has the sequence 
5'- 

CTCTGGGAGCTGCGATTGGCAGAATTCCGGCTAGAACTAGTGGATCCCCCNNNNN 

NNN^^^^NNNNN^ 

10 CAGGTCGACTCTAGAAAACATGAGGATCACCC (SEQ ID NO: 1 7). The R40 library 
will then be transformed into yeast and screened as was the RIO library. 



GEL MOBILITY SHIFT ASSAYS: Yeast nuclear extract was prepared according to 
standard procedures and was stored in Buffer A (25 mM Tris pH 7.6, 5 mM MgCU, 0. 1 mM 
EDTA pH 8.0, 80 mM KCI, 10% gycerol, 1.0 mM DTT, 0.48M (NH4).S04, ImM ZnCI). 
Radiolabeled riboactivator number 7 and control RNA number 1 0 were prepared in the same 



EXAMPLE 4 



Identification of a Riboregulator Interaction Target 



15 



Materials and Methods 



20 



buffer. Each RNA was incubated with yeast nuclear extract for 10 minutes at room 



temperature and the mixtures were separated on an acrylamide gel under non-denaturing 



conditions. 
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NORTHWESTERN ASSAYS: Northwestern blotting was performed according to Kwon et 
al. {Dev. Biol. 158:90, 1993). Briefly, 5-60 Mg of protein extract was separated by 
SDS/PAGE and transferred to PVDF membranes (Biorad). Proteins were allowed to renature 
and then membranes were probed with radiolabeled number 7 riboactivator loop sequence 
(CCCGGGUGCUGGAUCACCCGGG; SEQ ID NO:26). After extensive washings, the 
filters were exposed to X-ray film so that protein-RNA interactions were detected. 

Results 

Our gel mobility shift assays indicated that riboactivator number 7, but not control 
RNA number 1 0, participated in a high molecular weight complex when incubated in yeast 
nuclear extract. Some complexes still formed when denatured extract was employed. 

Northwestern analysis demonstrated that riboactivator number 7 bound to a protein in 
yeast nuclear extracts that was the size of yeast TBP (Figure 3). Further analyses with 
purified proteins revealed that number 7 interacts with yeast TBP, but not with yeast TFIID 
lacking TBP, with yeast RNA polymerase 11 holoenzyme, or with mammalian TFIID (Figure 
4). The Keene RNA, which does not activate transcription (see Example 2 above), also 
appears to bind to a protein in yeast nuclear extract that is the same size as TBP. 

Other Embodiments 

Those of ordinary sill in the art will recognize that the foregoing constitutes a 
description merely of certain preferred embodiments of the invention and is not intended to 
define or limit the scope of the following claims. Various changes, substitutions 
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modifications, and extensions can readily be achieved without undue experimentation and are 
intended to be within the scope of the claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Jarrell Ph.D., Kevin A. 
Saha Ph.D., Shamol 
Ptashne Ph.D., Mark 



^Sfi T^I^ ^""^^"^ TRANSCRIPTIONAL REGULATORS AND 



(iii) NUMBER OF SEQUENCES: 26 

(iv) CORRESPONDENCE ADDRESS 

(A) ADDRESSEE: Choate, Hall & Stewart 

(B) STREET: 53 State Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02109 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER; IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #i.O, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 26-AUG-1998 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Jarreil Ph.D., Brenda H. 

(B) REGISTRATION NUMBER: 39,223 

(C) REFERENCE/DOCKET NUMBER: 034794 1 -003 1 

(ix) TELECOMMUNICATION INFORVUTION- 

(A) TELEPHONE: (617) 248 5000 

(B) TELEFAX: (6 1 7) 248 4000 



(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(vii) IMMEDIATE SOURCE; 

(B) CLONE: Oligo 2, used to produce RIO library 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 
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CTCTGGGAGC TGCGATTGGC AGAATTCCGG CTAGAACTAG TGGATCCCCC GGGCGAGGCT 6( 
TATCCNNNNN NNNNNGGATG TGCTGACCCC GGGCAGCTTG CATGCCTGCA GGTCGACTCT 12 
AG AAAACATG AGGATCACCC 1 40 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo i, used to create RIO library 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO::: 
CTCTGGGAGC TGCGATTGGC 
(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo 3 used to create RIO library 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
GGGTGATCCT CATGTTTTCT 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS" 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Riboactivator consensus 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
UGCDGGHNMD 

(2) INFORMATION FOR SEQ ID NO;5: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Riboactivator 1 sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5; 
UGCGGGUACG 

(2) INFORiVL\TION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Riboactivator number 2 sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
UUGCUGGCGA 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Riboactivator 3 sequence 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
UGCGGGUCAU 10 
5 (2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

1 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



20 



35 



55 



60 



(ii) MOLECULE TYPE: other nucleic acid 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Riboactivator 4 sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
UGCGGGUUCG 10 
25 (2) INTORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Ribactivator 5 sequence 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
UGCGGGAUCA 10 
45 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Riboactivator 6 sequence 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
UGCAGGUUCG 

(2) INFORMATION FOR SEQ ID NO: 11 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; other nucleic acid 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Riboactivator 7 sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 : 
UGCUGGAUCA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Riboactivator 8 sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
UUGCUGGCGA 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Non-riboactivator 3 sequence 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CACGGTAAGT 
5 (2) INFORMATION FOR SEQ ID NO: 1 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

1 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

15 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Non-riboactivator 6 sequence 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 4: 
CAAAGACAGG 
25 (2) INFOR^L^TION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

35 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Non-riboactivator 8 sequence 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGCTGGTGGT 
45 (2) INFORMATION FOR SEQ ID NO: 1 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

55 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Non-riboactivator 10 sequence 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GTAGAGCGA 

(2) INFORMATION FOR SEQ ID NO: 1 7: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED>;eSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: R40 oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 7: 
CTCTGGGAGC TGCGATTGGC AGAATTCCGG CTAGAACTAG TGGATCCCCC NNNNNWNNN 60 
NNWiNNNNNN NNNNNNNNNN NNNNNNNTn^^ GGGCAGCTTG CATGCCTGCA GGTCGACTCT 120 
AGAA A ACATG AGGATCACCC , 40 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Keene-1 oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 8: 
CCGGGCGAGG CTTATCCTGG TGGAGCAGGA TGTGCTGACC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinsle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(vii) IMMEDIATE SOURCE: 
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(B) CLONE: K.eene-2 oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCGGGGTCAG CACATCCTGC TCCACCAGGA TAAGCCTCGC 
(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



20 (vii) LMMEDIATE SOURCE: 



CTGGTTAGAC CAGATCTGAG CCTGGGAGCT CTCTGGCTAA CTAGAGAAC 59 



(2) INFORMATION FOR SEQ ID NO:2 1 : 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinsle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Tar-2 oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
CCGGGTTCTC TAGTTAGCCA GAGAGCTCCC AGGCTCAGAT CTGGTCTAAC CAGAGAGAC 
(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(vii) LMMEDIATE SOURCE: 



(B) CLONE: Tar-1 oligonucleotide 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
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(B) CLONE: 5/Tar-I oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

CCGGGTCTCT CTGGTTAGAC CAGATCTGAG CCGGGATGCT CTCTGGCTAA CTAGAGAAC 59 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucieic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucieic acid 

(vii) nvnviEDIATE SOURCE: 

(B) CLONE: 5/Tar-2 oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

CCGGGTTCTC TAGTTAGCCA GAGAGCATCC CGGCTCAGAT CTGGTCTAAC CAGAGAGAC 59 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: 7/Sma-I oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CCGGGTGCTG G ATCAC j ^ 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(vii) IMMEDIATE SOURCE: 
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(B) CLONE: 7/Sma-2 oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
CCGGGTGATC CAGCAC 
(2) INFORMATION FOR SEQ CD NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Probe of riboaciivaior 7 loop 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
CCCGGGUGCU GGAUCACCCG GG 
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Claims 

1 . A transcriptional regulator comprised of RNA. 

2. A transcriptional regulator comprising: 
a DNA-binding moiety; and 

an RNA linked to the DNA binding moiety, the RNA having transcriptional 
regulatory activity. 

3. The transcriptional regulator of claim 2 wherein the DNA-binding moiety is selected 
from the group consisting of nucleic acids, polypeptides, intercalation compounds, and 
chemicals that demonstrate specific DNA binding. 

4. The transcriptional regulator of claim 2 wherein the regulatory RNA is characterized 
by an ability, when recruited to a DNA site operationally linked to a promoter, to activate 
transcription from that promoter at least two-fold. 

5. The transcriptional regulator of claim 2 wherein the regulatory RNA is characterized 
by an ability, when recruited to a DNA site operationally linked to a promoter, to activate 
transcription from that promoter to a level at least half that observed when the promoter is 
activated by Gal4 bound to a similarly-positioned site. 

6. The transcriptional regulator of any one of claims 2, 4, or 5, wherein the regulatory 
RNA has a structure comprising a stem-loop. 



-44- 



suBsnruTE sheet (rule 25) 



wo 99/10487 PCT/US98/17691 

7. The transcriptional regulator of claim 6 wherein the stem includes at least about 6 
base pairs. 

8. The transcriptional regulator of claim 6 wherein the stem includes at least about 1 0 
5 basepairs. 

9. The transcriptional regulatory of claim 7 wherein the regulatory RNA has a sequence 
comprising 5'-UGC(G>U>A)GG(U>A>C)(U>ACG)(C>A)(G>A>U)-3' (SEQ ID NO:4). 

10 10. The transcriptional regulator of claim 1 or claim 2, which transcriptional regulator. 

when recruited to a site operationally linked to a promoter, increases transcription from that 
promoter. 

1 1 . The transcriptional regulator of claim 1 or claim 2, which transcriptional regulator, 
15 when recruited to a site operationally linked to a promoter, decreases transcription from that 

promoter. 

12. The transcriptional regulator of claim 1 or claim 2, which transcriptional regulator 
affects transcription initiation. 



20 



1 3 . The transcriptional regulator of claim 1 or claim 2, which transcriptional regulator 



affects elongation. 
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14. TTie transcriptional regulator of claim I or claim 2, which transcriptional regulator 
affects a process selected from the group consisting of reinitiation, termination, and pausing. 

15. The transcriptional regulator of claim 1 or claim 2, which transcriptional regulator is 
acrive in one or more cell type selected from the group consisting of bacterial cells, yeast 
cells, mammalian cells, insect cells, plant cells, reptile cells, celenorate cells, and protozoan 
cells. 



1 6. The transcriptional regulator of claim 1 5 wherein the regulator is active in yeast, 
mouse, or human cells. 



1 7. The transcriptional regulator of claim 1 6 wherein the regulator is active in yeast and 
human cells. 



1 8. The transcriptional regulator of claim 1 or claim 2 wherein the regulator is active at 
more than one gene in a given cell. 

1 9. The transcriptional regulator of claim 1 0 wherein the regulator's activity is squelched 
in yeast cells by over-expression of Gal4. 

20. The transcriptional regulator of claim 10 wherein the regulator functions as an acidic 
activator. 
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21 . The transcriptional regulator of claim 1 or claim 2, which transcriptional regulator is 
constitutively active. 

22. The transcriptional regulator of claim 1 or claim 2, which transcriptional regulator is 
conditionally active. 

23 . An RNA transcriptional regulator identified by a method comprising steps of: 
providing a library of RNA molecules linked to an RNA comprising recruiting 

moiety; 

expressing the linked library of RNA molecules in a cell that also expresses an 
interacting moiety linked to a DNA binding moiety that specifically recognizes a site 
operatively linked to a promoter directing expression of a detectable gene, which interacting 
moiety associates with the recruiting moiety; 

detecting an increase or decrease in expression of the detectable gene; and 
designating those RNA molecules present in cells in which expression of the 
detectable gene is increased or decreased as riboregulators. 



24. 

of: 



A method of identifying RNA transcriptional regulators, the method comprising steps 



providing a population of RNA molecules; 

delivering the RNA molecules to a location operationally linked to a promoter; and 
identifying those RNA molecules within the population that alter the rate or amount 
of transcription from the promoter. 
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25. The method of claim 24 wherein the step of identifying comprises identifying those 
RNA molecules that effect at least a two-fold change in the rate or extent of transcription 
from the promoter. 

26. The method of claim 24 wherein the step of identifying comprises identifying those 
RNA molecules that effect at least a five-fold change in the rate or extent of transcription 

from the promoter, 

27. The method of claim 24 wherein the step of identifying comprises identifying those 
RNA molecules that effect at least a ten-fold change in the rate or extent of transcription from 
the promoter. 

28. The method of claim 24 where in the step of identifying comprises identifying those 
RNA molecules that effect at least a one hundred-fold change in the rate or extent of 
transcription from the promoter. 

29. The method of claim 24 where in the step of identifying comprises identifying those 
RNA molecules that effect at least a one thousand-fold change in the rate or extent of 
transcription from the promoter. 

30. A therapeutic composition comprising: 

an RNA molecule characterized by an ability to alter transcription from a promoter 
when recruited to a site operatively linked to that promoter; and 
a pharmaceutically acceptable carrier. 
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10 



are 



31. A method of treating an individual suffering from a disorder whose symptoms ; 
alleviated or cured by effecting a change in expression of a gene, the method comprising 
steps of: 

identifying an individual suffering from a disorder whose symptoms are alleviated or 
cured by effecting a change in expression of a gene; and 

administering to the individual an RNA molecule characterized in that, when the 
RNA molecule is recruited to a site operationally linked to the gene promoter, it effects the 
change in expression of the gene. 



32. A method of identifying a target RNA that interacts with a test RNA, the method 
comprising steps of: 

providing a first hybrid RNA molecule comprising a riboregulator linked to a test 

RNA; 

providing a library of second hybrid RNA molecules, each of which comprises a 
15 potential target RNA linked to an interacting RNA; 

providing a third hybrid polypeptide molecule comprising an interacting polypeptide, 
which interacting polypeptide associates with the interacting RNA, linked to a DNA binding 
moiety, which DNA binding moiety binds to a site operationally linked to a promoter 
directing expression of a detectable gene; 
20 expressing the first, second, and third hybrids in a cell also containing the gene under 

control of the promoter and operationally linked to the site; 

detecting those cells in which expression of the gene is altered; and 

classifying the potential target RNA of the library member in the detected cell as an 
RNA that interacts with the test RNA. 
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33. A method of identifying a target RNA that interacts with a test RNA, the method 
comprising steps of: 

providing a library of first hybrid RNA molecules, each of which comprises a 
riboregulator linked to a potential target RNA; 

providing a second hybrid RNA molecule, comprising a test RNA linked to an 
interacting RNA; 

providing a third hybrid polypeptide molecule comprising an interacting polypeptide, 
which interacting polypeptide associates with the interacting RNA, linked to a DNA binding 
moiety, which DNA binding moiety binds to a site operationally linked to a promoter 
directing expression of a detectable gene; 

expressing the first, second, and third hybrids in a cell also containing the gene under 
control of the promoter and operationally linked to the site; 

detecting those cells in which expression of the gene is altered; and 

classifying the potential target RNA of the library member in the detected cell as an 
RNA that interacts with the test RNA. 



34. A method of identifying a target protein that interacts with a test RNA, the method 
comprising steps of: 

providing a first hybrid RNA molecules, comprising a riboregulator linked to a test 

RNA; 

providing a library of second hybrid polypeptide molecules, each of which comprisi 
a potential target polypeptide linked to a DNA binding polypeptide, which DNA binding 
polypeptide binds to a site operationally linked to a promoter directing expression of a 
detectable gene; 
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expressing the first and second third hybrids in a call also containing the gene under 

control of the promoter and operationally linked to the site; 

detecting those cells in which expression of the gene is altered; and 

classifying the potential target polypeptide of the library member in the detected cell 

as a target polypeptide that interacts with the test RNA. 

35 . A method of identifying a target RNA that interacts with a test polypeptide, the 
method comprising steps of: 

providing a library of first hybrid RNA molecules, each of which comprises a 
riboregulator linked to a potential target RNA; 

providing a second hybrid polypeptide molecule comprising a test polypeptide linked 
to a DNA binding moiety, which DNA binding moiety binds to a site operationally linked to 
a promoter directing expression of a detectable gene; 

expressing the first and second hybrids in a cell also containing the gene under control 
of the promoter and operationally linked to the site; 

detecting those cells in which expression of the gene is altered; and 

classifying the potential target RNA of the library member in the detected cell as a 
target RNA that interacts with the test polypeptide. 

36. An RNA linker molecule comprising a first portion that interacts with a first 
interaction partner and a second portion that interacts with a second interaction partner. 



-51- 



SUBSTITUTE SHEET (RULE 26) 



THIS PAGE BUNK (uspto) 




SUBSTITUTE SHEET (RULE 26) 



THfS PAGE BIANK mm) 




SUBSTITUTE SHEET (RULE 26) 



THfS PAGE BUNK (uspto) 



wo 99/10487 



PCTAJS98/17691 



3/4 



YNE 



33K- 



YEAST TBP 



FIG. 3 



SUBSTITUTE SHEET (RULE 26) 



THfS PAGE BUNK (uspto) 



wo 99/10487 



PCT/US98/17691 



4/4 



1 2 3 4 5 



33K 




LANE 1 MARKER 

LANE 2 MAMMALIAN TFIID 

LANE 3 YEAST TBP 

LANE 4 YEAST TFIID (-TBP) 

LANE 5 YEAST POL II HOLOENZYME 



FIG. 4 



SUBSTITUTE SHEET (RULE 26) 



THIS PAGE BLAMK 



INTERl^frriONAL SEARCH REPORT 



Internatii AppHcation No 

PCT/US 98/17691 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12N15/11 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification svmbolsl 

IPC 6 C12N 



Documentation searched other than minimum documentation to the exXenl that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C^DOCUIWENTS CONSIDERED TO BE RELEVANT 



Category ' 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



D.J. SENGUPTA ET AL. : "A three-hybrid 
system to detect RNA-protein interaction 
in vivo" 

PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES USA, 

vol. 93, 1996, pages 8496-8501, 
XP002038710 

*see the whole article* 

U. PUTZ ET AL : "A tri -hybrid system for 

the analysis and detection of RNA-protein 

interactions" 

NUCLEIC ACIDS RESEARCH, 

vol. 24. no. 23, 1996, pages 4838-4840, 

XP000198724 

♦see the whole article* 

-/-- 



Further documents are listed in the continuation of box C. 



[X I Patent family members are (isted in annex. 



* Special categories of cited documents : 

"A" docurnent defining the general state of the art which is not 
considered to be of particular relevance 

"E" earfier document but published on or after the international 
filing date 

"L" document which may throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibitioner 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 



later document published after the intemationa! filing date 
or priority date and not in conflict with the application but 
cited to understand the prrncipie or theory underlying the 
invention 

document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

document of particular relevance; the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of the actual completion of the international search 

3 May 1999 


Date of mailing of the international search report 

18/05/1999 


Name and mailing address of the ISA 

European Patent Office. P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040. Tx. 31- 651 epo nl, 
Fax: (+31-70) 340-3016 


Authorized officer 

Marie, A 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



InternatK Application No 

PCT/US 98/17691 



C.(Contlnuatlon) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to clainn No. 



wo 96 29429 A (WISCONSIN ALUMNI RESEARCH 
FOUNDATION) 26 September 1996 
*see the whole patent* 

S.A. RICE ET AL.: "Bacterial reverse 
transcriptase and msDNA" 
VIRUS GENES, 

vol. 11, no. 2/3, 1996, pages 95-104, 
XP002101801 

*see the whole article* 

G. GILL AND M. PTASHNE: "Negative effect 
of the transcriptional activator GAL4" 
NATURE, 

vol. 334, no. 6184, 1988, pages 721-724, 
XP000002975 

*see the whole article* 

E. R. JUPE ET AL.: "The 3 'untranslated 
region of prohibitin and cellular 
immortal izati on" 

EXPERIMENTAL CELL RESEARCH, 

vol. 224, 1996, pages 128-135, XP000609312 

*see the whole article* 

F. RATINEJAD ET AL.: "Tumor suppression 
by RNA from the 3 'untranslated region of 
al f a-tropomyos i n " 

CELL, 

vol. 75, 1993, pages 1107-1117, 
XP002101802 

*see the whole article* 

M.D. CRESPI ET AL. : "enod40, a gene 
expressed during nodule organogenensis, 
codes for a non-translatable RNA involved 
in plant growth" 
EMBO JOURNAL, 

vol. 13, no. 21, 1994, pages 5099-5112, 
XP002101803 

*see the whole article* 



19 



1-36 



1-36 



1-36 



Form PCT/lSA/210 (continuation of second sheet) (July 1992) 



page 2 of 



2 



INTEF^PriONAL SEARCH REPORT 

Information on patent famHy members 



Patent document 
cited in search report 



Publication 
date 



Intornatl Application No 

PCT/US 98/17691 



Patent family 
member(s) 



Publication 
date 



WO 9629429 



26-09-1996 



US 
AU 
EP 
JP 
US 
US 



5610015 
3959795 
0765403 
10500865 
5677131 
5750667 



11- 03-1997 
08-10-1996 
02-04-1997 
27-01-1998 
14-10-1997 

12- 05-1998 



Form PCT/ISAy2lO (patent family annex) (July 1992) 



THIS PAGE BLANK (uspto) 



.-7 



D 



PATENT COOPERATION TREATY '^fcd ii 

• '^'=^0 U JUL 2000 

PCX jI^ 

INTERNATIONAL PRELIMINARY EXAMINATION REPORT 
(PCT Article 36 and Rule 70) 



Per J 



Applicant's or agent* s file reference 
0342941-0031 


FOR FURTHER ACTION Notification of Transmittal of International 

Preliminary Examination Report (Form PCT/IPEA/416) 


International application No. 
PCT/US98/17691 


International filing date (day/month/year) 
26 AUGUST 1998 


Priority date (day/month/year) 
27 AUGUST 1997 


International Patent Classification (IPC) or national classification and IPC 
IPC(7): C12N 15/11 and US CL: 536/23.1 


Applicant 

PRESIDENT AND FELLOWS OF HARVARD COLLEGE 



This international preliminary examination report has been prepared by this Intemational Preliminary 
Examining Authority and is transmitted^to the applicant according to Article 36. 



This REPORT consists of a total 



of^ 



. sheets. 



I I This report is also accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority, 
(see Rule 70.16 and Section 607 of Jhe Administrative Instructions under the PCT). 



These annexes consist of a total of 



sheets. 



3. This report contains indications relating to the following items: 



I 




II 


□ 


III 


□ 


IV 


□ 


V 




VI 


□ 


vn 


□ 


VIII 





citations and explanations supporting such statement 
Certain documents cited 



Date of submission of the demand 
25 MARCH 1999 


Date of completion of this report 
25 MAY 2000 


Name and mailing address of the IPEA/US 

Commissioner of Patents and Trademuks 
Box PCT 

Washington, D C. 20231 
Facsimile No. (703) 305-3230 


Authorized officer 

JOyCEBfUOGERS 
RAM R SHUKLA MMLEGAL SPECML^ 

Telephone No. (703) 30S-0\9f^ ^Au^IH — ^ 



Form PCT/IPEA/409 (cover sheet) (July 1998)* 



THfS PAGE BLANK (uspto) 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



International application No. 
PCTAJS98/17691 



L Basis of the report 



1 . With regard to the elements of the intenwriofial application:* 
[3r| the international application as originally filed 
the description: 



pages 
pages 
pages 



1-43 



NONE 



, as originally filed 

, filed with the demand 



NONE 



, filed with the letter of 



[ x| the claims: 



pages 
pages . 
pages . 
pages . 



44-51 



NONE 



NONE 



, as originally filed 

, as amended (together with any statement) under Article 1 9 
, filed with the demand 



NONE 



filed with the letter of 



[3c] the drawings: 



pages 
pages 
pages 



1-4 



NONE 



, as originally filed 

, filed with the demand 



NONE 



the sequence listing part of the description: 
pages _ NONE 



filed with the letter of 



pages 
pages 



NONE 



, as originally filed 

, filed with the demand 



NONE 



, filed with the letter of . 



2. With regaid to the language, all the elements marked above were available or furnished to this Authoiity in the language in which 
the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language vviiich is: 

I I the language of a translation furnished for the purposes of intemational search (imder Rule 23.1(b)). 
I I the language of publication of the intemational application (under Rule 48.3(b)). 

I I the language of the translation furnished for the purposes of intemational preliminary examination (tinder Rules 55.2 and/ 
or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the intemational application, the intemational 
preliminary examination was carried out on the basis of the sequence listing: 

I ^1 contained in the intemational application in printed form. 

px] filed together with the intemational application in computer readable form. 

I I furnished subsequently to this Authority in written form. 

[ I furnished subsequently to this Authority in computer readable form. 

□ The statement that the subseouently furnished written sequence listing does not go beyond the disclosure in the 
intemational application as filed has been furnished. 

□ The stateinent that the information recorded in conq)uter readable form is identical to the writen sequence listing has 
be^ furriishedL 

^ I x| The amendments have resulted in the cancellation of: 

CEl the description, pages NONE 

[3 the claims, Nos. NONE 

fx] the drawings, sheets/fig NONE 



5. [x] This report has been drawn as if (some oO the amendments had not been made, since they have been considered to go 

beyond the disclosure as filed, as indicated in the Si:5jplemental Box (Rule 70.2(c)).** 
♦ Replacement sheets which have been furmshed to the receiving Office in response to an invitation under Article 14 are referred to 
in this report as "originally fAed" and are not annexed to this report since they do not contain amendments (Rules 70.16 
and 70,17). 

**Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this report. 

Form PCT/IPEA/409 (Box I) (July 1998)* 



THrS PAGE BLANK (uspto) 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



International application No. 
PCT/US98/17691 



V. Reasoned statement under Article 35(2) with regard to novelty, invenHve step or industrial appUcabilitv- 
citations and explanations supporting such statement 



1. statement 

Novelty (N) 

Inventive Step (IS) 
Industrial Applicability (lA) 



Claims 4-18. 20-36 



Claims 1-3, 19 



Claims 4-18. 20-36 



Claims 1-3. 19 



Claims 1-36 



Claims NONE 



YES 
NO 

YES 
NO 



YES 
NO 



2. citations and explanations (Rule 70.7) 

Claims 1-3 lack novelty under POT Article 33(2) as being anticipated by Scngupta et al because Sengupta et al teach a 
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Claims 1-36 meet the criteria set out in PCT Article 33(4) for industria applicability. 



NEW CITATIONS 



NONE 



Form PCT/IPEA/409 (Box V) (July 1998)* 




THfS PAGE BUWK (uspto) 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



Internationa] application No. 
PCT/US98/17691 



VnL Certain observations on the international application 



The following observations on the clarity of the claims, description, and drawings or on the question whether the claims are fully 
supported by the description, are made: 

The description is objected to under PCT Rule 66.2(aXv) as lacking clarity under PCT Article 5 because it fails to adequately 
enable practice of the claimed invention because: the invention does not disclose the methods of gene therapy and whether gene 
therapy methods would have worked in treating any and all diseases as claimed. 

Claims 30 and 31 are objected to as lacking clarity under PCT Rule 66.2(aXv) because practice of the claimed invention is not 
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